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TREE-BASED LANGUAGE COMPLEXITY OF THOMPSON’S GROUP F 


JENNIFER TABACK AND SHARIF YOUNES 


Abstract. The definition of graph automatic groups by Kharlampovich, Khoussainov and Mi- 
asnikov and its extension to C-graph automatic by Murray Elder and the first author raise the 
question of whether Thompson’s group F is graph automatic. We define a language of normal 
forms based on the combinatorial “caret types” which arise when elements of F are considered as 
pairs of finite rooted binary trees, which we show to be accepted by a finite state machine with 2 
counters, and forms the basis of a 3-counter graph automatic structure for the group. 


1. Introduction 

It is not known whether Thompson’s group F is automatic; F has some characteristics of an auto¬ 
matic group such as quadratic Dehn function, type EPoo and word problem solvable in 0(n log n) 
time (see [TO], [2| and [T3|, respectively). Yet an automatic structure for the group remains elusive. 
Cuba and Sapir present a regular language of normal forms for elements of F in m, and Cleary 
and the first author show that F has no regular language of geodesics in [5|. 

With the extension of the notion of an automatic group to a graph automatic group by Kharlam¬ 
povich, Khoussainov and Miasnikov in [T2| , and further to a C-graph automatic group by Elder and 
the first author in [8], a natural question is whether F is captured by one of these larger classes of 
groups. To summarize these definitions for a group G with finite generating set S: 

(1) G has an automatic structure if there is a normal form for group elements that is recognized 
by a finite state machine, as well as finite state machines Mg that accept the language of 
pairs {g,gs), for each s £ S. The machines Mg are called the multiplier automata. 

(2) G has a graph automatic structure if there is a finite symbol alphabet used to define a 
normal form for group elements that is recognized by a finite state machine, as well as finite 
state machines Mg that accept pairs of normal form words corresponding to group elements 
that differ by s, for each s £ S. 

(3) G has a C-graph automatic structure, where C is a class of languages—for example regular, 
context free, counter or context-sensitive—and all languages in the previous definition are 
now allowed to lie in the class C. 

The introduction of a symbol alphabet used to express normal forms for group elements seems 
quite suited to Thompson’s group F. If group elements are expressed as reduced pairs of finite 
rooted binary trees, there is combinatorial information that can be used to express the group 
element uniquely. Namely, these trees are constructed from nodes, or carets, and there are several 
descriptions of caret types given in the literature that depend on the placement of the carets within 
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the tree. We follow those given by Elder, Fusy and Rechnitzer in [6] and dehne a quasi-geodesic 
normal form for elements of F. 

We show below that F is not graph automatic with respect to this natural normal form language 
over this set of symbols. This does not rule out the result with respect to a different normal form, 
or an alternate symbol alphabet. However, any symbol alphabet which captures the tree structure 
in any way seems unlikely to the authors to yield the result that F is graph automatic. 

In [7], Elder and the first author show that the standard infinite normal form for elements of F is 
the basis of a 1-counter graph automatic structure. The geodesic normal form we construct below 
based on caret types yields a 3-counter graph automatic structure for the group. This could be 
improved to a 2-counter graph automatic structure, but the increase in complexity of the argument 
did not balance out the decrease in counters, given the result in [7]. 

We remark that a third possibility for constructing a C-graph automatic structure for F would 
be to use the regular normal form language of Cuba and Sapir m- In preparing this article 
we considered the complexity of the multiplier automata for this language and found that the xi 
multiplier automaton would require at least 4 counters, and hence the resulting structure is less 
efficient overall than the structures obtained in [7] and this paper. 

The paper is organized as follows. In Section [2] we recall the definition of C-graph automatic 
groups introduced in [8] which generalizes the definition of graph automatic groups introduced by 
Kharlampovich, Khoussainov and Miasnikov in m- In Section [3] we present relevant background 
on Thompson’s group F. In Sections [4] and [5] we show that the language based on the symbol 
alphabet consisting of caret types forms a 3-counter graph automatic structure for F. 

2. Background on Generalizations of Automaticity 

There are many definitions in the literature of counter automata; we begin with our definition. 

Definition 2.1 (counter automaton). A nonblind deterministic fc-counter automaton is a deter¬ 
ministic finite state automaton augmented with k integer counters: these are all initialized to zero, 
and can be incremented, decremented, compared to zero and set to zero during operation. For each 
configuration of the machine and subsequent input letter, there is at most one possible move. The 
automaton accepts a word exactly if upon reading the word it reaches an accepting state with all 
counters returned to zero. 

In drawing a counter automaton, we label transitions by the input letter to be read, with subscript 
to denote the possible counter instructions: 

• = 0 to indicate the edge may only be traversed if the value of the counter is 0. 

• -|-1 to increment the counter by I. 

• —1 to decrement the counter by 1. 

In this paper we focus entirely on counter-graph automatic structures, which will be defined below. 
The class of counter languages is closed under homomorphism, inverse homomorphism, intersection 
with regular languages, and finite intersection (see m, for example). Moreover, the intersection of 
a /c-counter language with a regular language is /c-counter, and the intersection of k- and /-counter 
languages is a (A: -|- /)-counter language, as proven in [8]. 

We introduce the notion of a convolution of strings in a language, following m, for ease of notation 
in the multiplier languages we later define. Let G be a group with symmetric generating set X, 
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and A a finite set of symbols. In general we do not assume that X is finite. The number of symbols 
(letters) in a word n G A* is denoted |n|A. 

Definition 2.2 (convolution; Definition 2.3 of |12]). Let A he a finite set of symbols, o a symbol not 
in A, and let Li,..., Lk be a finite set of languages over A. Aet Ao = Au{o}. De/ine f/ie convolution 
of a tuple (tci,..., Wk) ^ Li x ■ ■ ■ x to be the string (8>(rci,..., Wk) of length max(|rcj|A) over the 
alphabet [Afi)^ as follows. The ith symbol of the string is 

( 

\ Afc 

where Xj is the ith letter of wj if i < \wj\A and o otherwise. Then 

(8)(Li, ...,Lk) = {<8)(tci, ...,Wk) \wiG Li} . 

As an example, if wi = aa, W 2 = bbb and W 3 = a then 

/ a \ / a 

\a J \ o 

When Li = A* for all i the definition in [1^ is recovered. 

We require that the normal form language on which a C-graph automatic structure is based be 
quasigeodesic. 

Definition 2.3 (quasigeodesic normal form). A normal form for (G,X,A) is a set of words L C A* 
in bijection with G. A normal form L is quasigeodesic if there is a constant D so that 

kU < D{\\u\\x + 1) 

for each u ^ L, where ||u||x is the length of a geodesic in X* for the group element represented by 

u. 



The ||tt||A: + l in the definition allows for normal forms where the identity of the group is represented 
by a nonempty string of length at most D. We denote the image of n G L under the bijection with 
G by n. 

We now state the definition of a C-graph automatic group, following [5], which generalizes the 
notion of a graph automatic group introduced in [12]. 

Definition 2.4 (C-graph automatic group). Let C be a formal language class, {G,X) a group and 
symmetric generating set, and A a finite set of symbols. We say that (G, X, A) is C-graph automatic 
if there is a normal form L C A* in the language class C, such that for each x X the language 
Lx = {(8)(tt,n) I n, u G L,T =g ux} is in the class C. 

The following observation is proven in |8|. 

Lemma 2.5 (Lemma 2.5 of [8]). Let Li and L 2 be k- and l-counter languages respectively. Then 
®{Li,L 2 ) is a {k + l)-counter language. 

We end this section with a lemma describing when certain languages of convolutions are regular; 
this lemma will be used to streamline many of the proofs in later sections. 

Lemma 2.6. Fix a symbol alphabet A and let ai,a 2 ,bi,b 2 be fixed (possibly empty) words in A*. 
Then the following languages are all regular. 
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(1) Li = {^{aiw, aibiw) \ w G A*} 

(2) L 2 = {'Si{aiw,biw) I w G A*} and L 3 = {®{wa 2 ,wb 2 ) \ w G A*} 

(3) L 4 = {^{wiaiW 2 a 2 W 3 ,WibiW 2 b 2 W 3 \ Wi,W 2 ,W 3 G A*} 

Proof of (1). Since {®(ai,ai)} is regular and the set of regular languages is closed under concate¬ 
nation, this problem reduces to recognizing the language 

L = {( 8 >(rc, bw)} 

with a finite state automaton. 

Let | 6 | = p. Enumerate all strings of length p in A*: si, S 2 , S 3 ,, Sn. For each Si, create a state Si 
and add a path of p edges from the start state qo to Si labeled by the successive letters in ( 8 >(sj, b). 
Next, for each pair (Ai,A 2 ) G A x A add an edge from Si —)• Sj labeled (Ai,A 2 ) if and only if 
Si = A 2 a: and sj = xAi for x G A*. Finally, for each Sj, construct a path from Si to an accept state 
F labeled by ( 8 i(e, Si) = (o^, Si). 

The edge labels ensure that if a prefix of n) describes a path from the start state to state Si, 
then the last p letters read from u are exactly the string s*. Reading the next letter of ®(u, v), say 
(P, to transition to state Sj means that the final p letters now read from u are Sj and that the 
initial letter of Sj was y. In this way we check that the string bw contains the string w shifted p 
places by the insertion of b. 

Cases ( 2 ) and (3) are proven by similar arguments and we omit them here. □ 

3. Background on Thompson’s group F 
Thompson’s group F can be equivalently viewed from three perspectives: 

( 1 ) as the group defined by the infinite presentation 

T^inf = xi,X 2 ■ ■ ■ \xjXi = XjXj+i whenever i < j) 
or the finite presentation 

Vfin = {xo,Xi\[Xq^Xi,Xq^XiXo],[Xq^Xi,Xq‘^XixI]). 

( 2 ) as the set of piecewise linear homeomorphisms of the interval [ 0 , 1 ] satisfying 

(a) each homeomorphism has finitely many linear pieces, 

(b) all breakpoints have coordinates which are dyadic rationals, and 

(c) all slopes are powers of two. 

(3) as the set of pairs of finite rooted binary trees with the same number of nodes, or carets. 

For the equivalence of these three interpretations of this group, as well as a more complete intro¬ 
duction to the group, we refer the reader to [3]. If 5 = {T, S) is an element of F given as a pair of 
finite rooted binary trees, then the tree T determines the linear intervals in the domain of g, and 
the tree S determines the linear intervals in the range; again see [3] for details. 

3.1. Terminology and notation. In this section we define the terminology that we use in creating 
the 2-counter language of normal forms for elements of F. A caret consists of a vertex in a tree T 
together with two edges that we draw with a downward orientation. A vertex in the tree T 

( 1 ) is called a /ea/iff it has valence one; 

( 2 ) is the root of the tree iff it has valence two; and 
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(3) has valence three otherwise. 


A caret in T has a left edge and a right edge; we refer to a caret D as the left (resp. right) child 
of caret C if the vertex of D is the endpoint of the left (resp. right) edge of C. A caret is called 
exterior if at least one of its edges lies on the left or right side of the tree, and interior otherwise. 
A caret is exposed if it has no children. 

Let (r, S) denote a pair of finite rooted binary trees with the same number of carets, which nec¬ 
essarily corresponds to a unique element of Thompson’s group F. We refer to this as a tree pair 
diagram representing the element. The carets of T and S are numbered in increasing infix order: 
beginning with the leftmost descendant of the root caret, the left child of any caret is numbered 
before the caret, and the right child is numbered after it. We then pair carets with the same inhx 
number from the two trees and refer to a caret pair with caret number n. 

3.2. Reduction criterion. To ensure a bijective correspondence between elements of F and pairs 
of finite rooted binary trees with the same number of carets, we impose an additional requirement. 
Viewing an element of T as a piecewise linear homeomorphism of the interval, subject to the above 
conditions, we can always trivially add additional breakpoints within a linear component without 
altering the function. However, such breakpoints are superfluous. We define a reduced tree pair 
diagram to correspond to a function without unnecessary breakpoints. Exposed carets with identical 
inhx numbers in both trees introduce a superhuous breakpoint to the analytic representation of a 
group element. By removing these unnecessary caret pairs we obtain a reduced tree pair diagram. 
An element g ^ F \s thus represented by an equivalence class of tree pair diagrams with a unique 
reduced diagram. When we refer to a tree pair diagram representing a group element, we mean the 
reduced diagram unless otherwise indicated. 

3.3. Multiplication of tree pair diagrams. Multiplication of elements of F when viewed as 
piecewise linear homeomorphisms of the interval [0,1] is simply composition of functions; “mul¬ 
tiplication” of tree pair diagrams is a combinatorial construction which mimics composition of 
functions. Consider g = (Ti,T 2 ) and h = (S'i,S' 2 ) in F; create unreduced representatives of both 
g and h, which we denote by {T{,T 2 ) and {S{,S 2 ) respectively, so that = S{. The product hg 
is then represented by the possibly unreduced tree pair diagram (Tl,^^). For explicit examples of 
multiplication of elements of F we refer the reader to [Ij. 

Multiplication by the generators xq and xi and their inverses induce particular combinatorial 
rearrangements of a given tree pair diagram which are explicitly described in Section [5l 

3.4. Caret labels. In Section |4] below we develop a set of normal forms for elements of F using 
an alphabet consisting of caret types or labels. We give each caret a label based on its position in 
the tree, and form these strings into pairs by inhx order. 

There are several existing sets of caret labels in the literature; Fordham in [9] uses caret labels to 
assign weights to each caret pair. He then proves that the sum of the weights corresponding to 
a (reduced) element is exactly its word length with respect to the hnite generating set {xo,xi}. 
Unfortunately, Fordham’s caret labels are not in one-to-one correspondence with conhgurations of 
carets in a tree, as illustrated in Figure [H Fordham dehnes a caret to be interior if neither edge 
comprising the caret lies on the left or right side of the tree, and further subdivides these carets 
into type /q, which have no right child, and type Ir, which do. The configurations of carets in 
Figure [U both correspond to the string of labels Ir, Iq,Iq, where the labels are listed in inhx order. 

We rehne Fordham’s dehnition of interior caret types to avoid the above problem, and condense 
his left and right caret types (comprising carets with an edge on either the left or right side of 
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Ir, Iq, h 


Ir, Io,Io 


Figure 1. Fordham’s caret labels are not in bijective correspondence with binary 
trees: two different interior subtrees corresponding to the same string of labels. 

Labels are listed in infix order. 

the tree) into a single exterior type, while distinguishing the root caret. Define the following caret 
types within a finite rooted binary tree: 

(1) the root caret is labeled “r”, 

(2) all other exterior carets are labeled “e”, 

(3) interior carets are labeled as follows: 


Caret C 

is the left child of an 
interior caret or the 
child of an exterior caret 

is not the left child of an 
interior caret nor the 
child of an exterior caret 

has a right child 

( 

b 

has no right child 

a 

) 


Listing the caret types of the carets in a binary tree in infix order yields a word over the alphabet 
Dr = {r,e, (, ),a,b}. For example, the tree in Figure [2] is encoded by the word eea()(a6)raee. 

These caret labels are easily translated into those used by Elder, Fusy and Reichnitzer in [6] (which 
are based on diagrams in my, their labels mark only interior subtrees. Leaves are labeled “N” or 
“I” and carets are labeled “n” or “i” as in Figured 

There is a straightforward translation between the notation in [6] and our own: 

(1) If a caret is labeled “n,” the caret is a left child or else the root caret; 

(2) if the caret is labeled “i,” then the caret is a right child. 

(3) If the leaf directly to the right of a caret is labeled “N,” then the caret has a right child; if 

the leaf directly right of a caret is labeled “I,” then the caret does not. 

Following these rules, replace any appearance of “nN” with “(”, “il” with “)”, “nl” with “a”, and 

“iN” with “5”. 

4. A 2-counter language of normal forms for elements of F 

We now define a language F of normal forms for elements of F based on the caret types defined 
in Section f3.4l which is accepted by a nonblind deterministic 2-counter automaton. This forms the 
basis of a 3-counter graph automatic structure for F ; the additional counter is required to construct 
the multiplier automaton for xi. 
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Figure 2. An example of a tree corresponding to the string of labels eea{){ab)raee. 



Figure 3. Leaf and caret labels used in [6]. Leaves are labeled “N” or 
carets are labeled “n” or “i.” By convention, the root caret is labeled “n” 
left-most leaf is not labeled. 

Theorem 4.1. Thompson’s group F is nonblind deterministic ^-counter-graph 
quasigeodesic normal form with respect to the generating set S = {xo,xi} and the 
T^t = {e, r, (,), a, 6} of caret types. 

This theorem is proved in the subsequent sections. Moreover, we show that the language of normal 
forms defined over this symbol alphabet can never be accepted by a 1-counter automaton. 

4.1. Word Acceptor. The language F is constructed in stages, always taking the symbol alphabet 
to consist of the caret labels {e, r, (,), a, b}. 

(1) We first define a 1-counter language Lint containing all strings that uniquely describe pos¬ 
sible interior subtrees of a finite rooted binary tree. Exterior caret labels are not used. 

(2) Next we extend Lint to a 1-counter language Ltree whose strings uniquely describe finite 
rooted binary trees. 

(3) We define a 2-counter language Lt which is the intersection of ®{Ltree, Ltree) with a regular 
language ensuring that no o symbols appear in any convolutions. This language describes 
all pairs of finite rooted binary trees with the same number of carets. 

(4) Finally, we check that strings in Lt correspond to reduced pairs of binary trees by intersect¬ 
ing with a regular language R which checks a simple reduction criterion. Then F = Lt H R. 


“I” and 
and the 


automatic with 
symbol alphabet 
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4.1.1. A language describing interior subtrees. In [6] a simple criterion is given for when a string 
of caret labels consisting of “AI", “I", “n" and “i" (as defined in Figure [3]) represents an interior 
subtree of a finite rooted binary tree. We restate this lemma using the alphabet T,int = {(,), a, 6} 
of interior caret labels. A translation between the two sets of labels is given in Section 13.41 

Lemma 4.2 (Lemma 6 of [^). Let w = si ■ ■ ■ Sn be a word in (Sj„t)*. Let Wj denote the prefix 
Si • • • Sj and let \u\y denote the number of occurrences of “y" in u. Then w encodes a (possibly 
empty) interior subtree if and only if the following conditions hold: 

(1) For any 0 < j < n such that Sj+i E {(,o}; 

(2) |u;|( = |in|). 

Let Lint denote the set of all strings satisfying the conditions in Lemma 14.21 (including the empty 
string). Then Lint is a nonblind deterministic 1-counter language accepted by a machine where 
the input is read from left to right and the counter instructions are as follows: 

( 1 ) the counter increments by 1 after reading “("; 

( 2 ) read “)" only when the value of the counter is non-zero, and subsequently decrement the 
counter by 1 ; 

(3) proceed to a fail state if the counter is zero and “6" is read. 

These counter instructions correspond exactly to the following arrangement of carets in a binary 
tree. A caret with label corresponds to the root caret of an interior subtree with a nonempty 
right subtree; there will be a corresponding caret labeled which is the final caret (in infix order) 
of this subtree. 

As every element in Lint corresponds to a possible interior subtree of a finite rooted binary tree, we 
can formalize this identification by defining a mapping Tint ■ Lint {interior subtrees}. To show 
that Tint is well-defined we must verify that any word w E Lint encodes a unique interior subtree. 
This is easily checked by considering all combinations of pairs of caret labels in T,int = {(; )> 
verifying that each combination precisely defines where the corresponding carets must be positioned 
in the interior subtree. A chart verifying this is included in the Appendix. If a E Lint is a string 
of caret types from Tiint, we build a unique interior subtree as follows. Each adjacent pair of caret 
types beginning on the left side of the string uniquely determines the relative placement of the 
carets in the subtree being constructed as shown in Figure [71 Hence Tint is a bijection between the 
set of possible interior subtrees of a finite rooted binary tree and the set of words in the language 
Lint- 

4.1.2. A language describing binary trees. Next we construct a language Ltree over the alphabet 

St = {e,r, {,),a,b} 

which accepts exactly those words corresponding to finite rooted binary trees. Let Ktree denote the 
machine which accepts Ltree- A binary tree containing no interior carets will correspond to a word 
of the form where rei,n 2 E NUjOj. In general, a binary tree has interior subtrees attached 

to any collection of exterior carets. Thus, a string w in S*^^ describing such a tree must satisfy the 
following conditions: 

( 1 ) w begins with either “e'' or “r", 

( 2 ) w ends with either “e'' or “r", 

(3) w contains exactly one “r", and 

(4) for any substring Si-iSt - - - SjSj+i, if Sj_i, Sj+i E {e, rj and Si - - - Sj E S*^^, then Sj • • • Sj is 
a word in Lint- 


Let Ltree be the set of words satisfying conditions (1) through (4). Since Lint is a 1-counter language, 
condition (4) can be checked with a single counter, as follows. Note that substrings Si-iSi ■ ■ ■ SjSj+i 
and Sk-iSk ■ ■ ■ sisi^i may overlap only at the first and final letters, which lie in {e, r}. Upon reading 
“e" or “r" with a subsequent letter from a machine scanning the word simulates the machine 
for Lint on the substring s* • • • SjSj, and repeats this process when it encounters the next letter from 
{e,r}. 

The three remaining conditions are easily be recognized with finite state automata. Hence Ltree is 
a nonblind deterministic 1-counter language. 

We can now extend the function Tint ■ Lint {interior subtrees} to a bijection 

Ttree ■ Ltree ^ {finite rooted binary trees}. 

4.1.3. A language describing pairs of binary trees. To describe pairs of binary trees with the same 
number of carets, we use the convolution of two strings from Ltree which are the same length. 
Namely, Lt is the intersection of 0{Ltreej Ltree) with a regular language of convolutions of elements 
of Sy containing no o symbols. It follows from Lemma 12.51 that Lt is a 2-counter language. 
Analogously, there is a bijection 

r : Lt —>■ {pairs of finite rooted binary trees}. 

If ®{u,v) € Lt then the top line in the convolution determines the first tree in the pair, and the 
bottom line the second tree in the pair. Let Mt denote the machine which accepts Lt- 

4.1.4. Reduction Criterion. Next we test whether a string in Lt corresponds to a reduced pair of 
trees. We first observe sequences of labels which must appear in a string corresponding to a tree 
pair diagram that is unreduced. For each sequence, we intersect Lt with the complement of the 
corresponding regular language, using the fact that the set of regular languages is closed under 
complement at ion. 

Given a finite rooted binary tree T, it is easily checked that: 

(1) the first exterior caret of T is exposed iff the word encoding it begins with “ee'' or “er"; 

(2) the last exterior caret of T is exposed iff the word encoding it ends with “re" or “ee"; 

(3) a symbol “y" encodes an exposed interior caret of T iff y G {), a} and is preceded by some 
X € {e,r, {,b}. Recall that “)" and “a" are the two interior caret types which label carets 
that have no right child. 

There are three ways that a pair of binary trees representing a non-identity element of F may be 
unreduced: 

(1) the first exterior caret of each tree is exposed; this occurs iff the word in Lt describing the 
pair of trees begins with (g) for yt € {e,r}. 

(2) the last exterior caret of each tree is exposed; this occurs iff the word in Lt describing the 

pair of trees ends with: (g) for Xi € {e, r}. 

(3) each tree has an exposed interior caret with infix number n; this occurs iff the word in Lt 

describing the pair of trees contains any of the following strings: for Xi G {e, r, (, b} 

and yi G {),«}, where the yt represent the exposed interior carets. 

Each of the above cases can be detected with an finite state machine. Let R denote the set of all 
strings that do not contain any of these sequences. Then R is regular and, moreover, R consists of 
exactly those strings that do not fail any of the three reduction criteria. Let F = Lt H R. Since 
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each word in Lt encodes a pair of binary trees with the same number of carets, each word in T 
encodes a reduced pair of binary trees, that is, a unique element of F. Furthermore, because Lt is 
a 2 -counter language, it follows that F is as well. 

We can now restrict the bijection r to obtain another bijection 

Tred '■ ^ {reduced pairs of finite rooted binary trees}. 

Thus we have shown that W is a 2-counter language which is a normal form for Thompson’s group 
F. 

Let V = T~^. If {T, S) is a pair of binary trees, then v{T,S) is the string in Lt representing the 
caret types from the pair. We abuse this notation in several ways. 11 g = {T,S), we write iy'{g) 
interchangeably with z/(T, S'). 11 g = (T, S) is reduced, then we understand that i'{g) = T~^{g) = 
Fedid)- Thus in this case v{g) G F. 

We now show that the language T” is a quasigeodesic normal form for F. 

Proposition 4.3. The normal form language F for Thompson’s group F is quasigeodesic. 

Proof. It follows from work of Fordham [9| that the number of carets in either tree in a reduced 
tree pair diagram for g F is coarsely equivalent to the word length of g with respect to the finite 
generating set {xq, xij. The length of an element of F is exactly the number of carets in either tree 
in the reduced tree pair diagram representing g ^ F, hence W is a quasigeodesic normal form. □ 

We next show that the language F of normal forms is not a context-free language. Since 1 -counter 
languages form a subset of the set of context-free languages it follows that we can not improve on 
Theorem O We prove Lemma 14.51 using Ogden’s Lemma, which is a corollary to the standard 
pumping lemma for context-free languages, and is stated below. 

Lemma 4.4 (Ogden’s Lemma, [H] lemma 6.2). Let L be a context free language. Then there is a 
constant p so that if z is any word in L, and we mark any p or more positions of z as distinguished, 
then we can write z = uvwxy so that 

(1) V and X together have at least one distinguished position, 

(2) vwx has at most p distinguished positions, and 

(3) for all i >11 we have uv^wx^y G L. 

Ogden’s Lemma allows us to omit the first letter of any string in the following proof from the 
substrings v and x in the application of the lemma, an assumption one cannot make when applying 
the standard pumping lemma for context-free languages. 

Lemma 4.5. Let F be as above. Then F is not context-free. 

Proof. Assume for contradiction that F is context-free. Let p be the constant guaranteed by 
Ogden’s Lemma, and consider the convolution 



which lies in F. Mark all but the first and last symbol as distinguished, and apply Ogden’s Lemma. 
As V and x together must have at least one distinguished position, one of these strings must contain 
either “(" or “)" in either the top or bottom line. First suppose vx contains “(" in the top line. 
Then vx contains a symbol from the substring 
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However, since |nr(;x| < p and |('()^| = p, it follows that vx cannot contain the symbol in the 
top line. An almost identical argument shows that if vx contains in the bottom line, it cannot 
also contain in the bottom line. In either case, uv^wx^y ^ J- for i ^ 1, and hence J- is not 
context free. □ 


5. Multiplier languages for the 3-counter graph automatic structure 


We now construct the automata Mx^ and Mx ^, which accept the multiplier languages 

Lxq = {0(u, v)\u, n G W, uxq = n} 


and 


Lxx = {'S!{u,v)\u,v € W, uxi = n}. 

As noted in [8], when constructing C-graph automatic groups, a multiplier automaton which reads 
the convolution (2)(u,n) must both check that u and v lie in the normal form language and that 
uxi = V. This is not an issue for either automatic or graph automatic groups; in both of these 
scenarios, the normal form language is regular, and since intersections of regular languages are 
regular it is not necessary for the multiplier automata to check membership in the normal form 
language. As our normal form languages are not regular, the multiplier automata must also verify 
this membership condition. The language Lxq is accepted by a nonblind deterministic 2-counter 
automaton, while Lx^ is accepted by a nonblind deterministic 3-counter automaton. 


It is proven in [8] that for M = {regular languages} or = {1-counter languages} we have Lx ^ M 
(resp. if and only if G (resp. ^i). Hence it suffices to consider only the multiplier 
languages Lxq and Lx^. 


5.1. The multiplier language Lxq- Multiplication ol g = {T,S) by xq induces a particular 
rearrangement of the subtrees of the original diagram, as shown in Figure |4] when the root caret 
of T has at least one left child. To compute this product, the tree pair diagram for xq, which 
corresponds to (8)(re, er) G W, is placed to the left of the tree pair diagram from g. Then (possibly) 
unreduced representatives of the two elements are created in which the middle trees are identical; 
this may entail adding an additional caret as a left child to caret 0 in both T and S. The outermost 
trees together form the product gxQ. The final result of any multiplication of tree pair diagrams is 
a unique reduced tree pair diagram, but this may include an intermediate stage where the resulting 
pair of trees is not reduced. 


T from g = (T, S) T' from gxo = (T', S') 



Figure 4. Part of the tree pair diagrams for g = (T, S') and gxo = {T',S'). The 
letters A, B, and C represent possibly empty subtrees, and S = S'. 

Proposition 5.1. The multiplier language for the generator xq defined by 

Lxq = {<8)(u, n)|n, u G W and uxq =fv} 

is recognized by a nonblind deterministic 2-counter automaton. 

II 








Proof. The language L^q will be the union of three sub-languages, based on the analysis of multi¬ 
plication by xq given below. 

Let g = (T, S) as a reduced pair of trees. Recall that v{g) G T” is the convolution of strings of caret 
types describing the trees S and T. To describe u = v{g) and v{gxo) we divide into three cases 
depending on whether the root caret of T has no left child, no right child, or has children on both 
sides. In terms of u = ^{g)., these cases correspond, respectively, to 

( 1 ) an initial letter of “r'' in the top line of u when written as a convolution; 

( 2 ) initial letter of and final letter “r'' in the top line of u when written as a convolution; 

(3) neither of the above conditions. 

In each case we obtain a unique string in T which corresponds to gxQ. 

Case 1: the root caret of T has no left child. To perform the multiplication by xq a caret must 
be added to the left leaf of the root caret in T and to the left leaf of caret 0 in S, creating an 
unreduced representative g* = {T*,S*). Let gxo = {T',S') denote the product as a reduced tree 
pair diagram. 

In this case we must have v{g) = (^ 2 ) some zq G {e, r} and 71,72 G and using the 

combinatorial rearrangement of carets induced by multiplication by xq, we can write 

• Ha) = Oil), 

•U9-) = (:)(;)©. and 

•n(9n„) = 0(;)7). 

Notice that this includes g being the identity, in which case iy{g) = 

Case 2: the root caret of T does have a left child and the following 

(i) the root caret of T has no right child, 

(ii) the left child of the root caret of T has no right child, and 

(iii) the last exterior caret of S is exposed. 

Then multiplication by xq yields an unreduced representative {gxo)* 
a redundant caret from both T* and S* to obtain gxQ = (T', S'). 

Given conditions (i), (ii) and (iii), the string v{g) must end with the symbols (g) for some 
zq G {e, r}. Hence the string representing the unreduced {gx^)* = (T*, S*) must ends in (J^) (g). To 
obtain the reduced tree pair diagram for the product, the final redundant caret must be eliminated 
from the tree pair diagram (r*,5*); specifically, the right child of the root in T* has two exposed 
leaves, as does the final (rightmost) caret in S* . These are removed to obtain the reduced tree pair 
diagram {T',S') for gxQ. 

Thus, for some 71,72 G and zq G {e, r} we have 

•-(5) = 0U)(D, 

• H9x*o) = CjDOO, and 

• ty{gxo) = ©(;j. 


(r) and iy{gxo) = Q (^). 

three conditions hold 


= (T*, S*). We must eliminate 
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Case 3; the element g = (T, S) does not belong to Case 1 or Case 2. 

In this case, the rearrangement of the subtrees of the domain tree is exactly as depicted in Figured 
Multiplication has no effect on the range tree: that is, S = S'. Let € Lmt denote the 

substrings that encode the subtrees A,B, and C, respectively and xa,Xb,Xc € the strings of 
labels for the carets in S paired, respectively, with the carets in subtrees A, B and C of T. Then 

•K9) = ey{:)Qy (;)©“<* 

• = (-)(;)©(;) Co) 

for some Zo,zi G T,t- 

Define a language A/q to consist of the union of all convolutions ®{u,v) of the following forms: 

• “=(i;) t'o) (D “d “ = (5) (i) 

• “ = & C) & Q -d „ = (2) (;) (-) (i) (2) 

where zq,zi € and Xi ^ Note that u and v may no longer lie in B or even Lt- 

It follows from Lemma 12.61 that these three sub-languages are regular, and hence their union A/q 
is regular as well. The language is then the intersection of A/q with the 2-counter language 
{(8i(u, v)\u G T, V G because the construction of A/q guarantees that ifu G B and (8)(u, v) G A/q, 
then the string v lies in A. Hence the language Lxq can be recognized by a nonblind deterministic 
2-counter automaton. □ 

5.2. The multiplier language If 5 = (T, S) G F, then the process of multiplying g by xi is 
described in five cases which depend on the arrangement of carets in T and S. 

In this section, if <7 = (T,S) is as in Figure [5l we will use Xa-,1b-,1c and xd to denote the strings 
in Lint that encode the subtrees A,B,C and D, respectively. Let xa,Xb,Xc and xd denote the 
strings in that encode the carets of S which are paired, respectively, with subtrees A, B, C and 
D. If xr is empty for R G {A, B, C, D} then we say that XR is empty as well. 

T from g = (T, S) T' from gxi = {T', S') 



Figure 5. If g = {T,S) and gxi = {T',S'), where the tree T contains at least a 
root caret whose right child has a left child, the combinatorial rearrangement of the 
subtrees of T yielding T'. Here, A,B,C and D represent possibly empty subtrees 
and S = S'. 

We now identify all pairs of strings in F which correspond to possible pairs g and gxi. In Cases 
(1) through (4) below the difference between iy(g) and i'{gxi) lies either at the end of the string 
or immediately after the unique caret of type in the top line of the convolution y{g). A finite 
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state machine can easily detect this change. Then two counters are required to ensure that ^{g) 
lies in and by the construction of the language, v{gxi) must lie in F as well. Thus the set of 
all convolutions described in Cases (1) through (4) will form a nonblind deterministic 2-counter 
language. The set of strings described in the final case of this section will require an additional 
counter to detect that the change between ^{g) and v{gxi) lies at the correct position in the interior 
of the convolution. 

Case 1; Two carets must be added io g = (T, S) in order to multiply by xi. Suppose first that the 
root caret of T has no right child. Then we must add a right child with a left child to the rightmost 
carets of T and S, creating an unreduced representative of g, before performing the multiplication. 
Let g* = {T*,S*) denote this unreduced representative for g. 

Since the root caret of T has no right child, the string corresponding to T ends with “r". Then for 
some zq € {e,r}, we have 

• -(!?) = Cl) Co) > 

• Ha*) = Cl)Co)(a)D’ and 

• Haxi) = CDdJDD- 

Notice that this case also includes the identity with "Ja = XA = e and zq = r. Then ^{xi) = 

QOO- 

Case 2; One caret must be added to g = (T, S) in order to multiply by xi. Suppose that the root 
caret of T has a right child which does not have a left child, so that the string in corresponding 
to T is XAxeXD- To perform multiplication by xi, we must add a left child to the right child of 
the root caret of T and to the corresponding leaf in S, creating an unreduced representative of g, 
which we denote g* = (T*, S*). We consider three possibilities for this multiplication, based on the 
caret type zq of the caret in S paired with the root caret of T. Let n denote the infix number of 
the root caret in T and the corresponding caret of type zq in S. 

In subcases (a)-(c) below, we always have 



where Zo,zi G Sy. 

(a) First suppose zq G {e,r, (,6)}. Then the new caret is added to S as the left child of the 
caret of infix number n -|- 1 and will be type in S* = S'. Then we have 

(b) Next suppose zq is type “a". Then the new caret is added to S as the right child of caret 
n (of type zq), which changes they type of caret n in 5 to “(", and the new caret is of type 

Then we have 

(c) Finally, suppose zg is type As in (b), the new caret is added to S as the right child of 
caret n, which changes to type “b", and the new caret is of type “)". Then we have: 

•U9') = e)C)(p(;)(s),».id 
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In all three cases, i^{gxi) is a reduced string. If xd is empty, then z\ e because v{g) is itself a 
reduced string. Note that for a given string ^{g) in Case 2, a finite state automaton can decide 
which subcase i'{g) lies in. 

Case 3: After multiplication by xi, two carets must be removed to obtain the reduced tree pair 
diagram for the product. Suppose that the root caret of T has a right child which has a left child. 
If the following three conditions hold 

(i) the right child of the root caret of T does not have a right child; 

(ii) the left child of the right child of the root caret of T has no children; and 

(iii) the final two carets of S are exterior carets with no interior children 

then multiplication yields an unreduced representative gx\ = (T*,S*). We must eliminate two 
redundant carets from each of T* and S* to obtain gxi = (T', S'). 

Given conditions (i) and (ii), the string corresponding to T ends with ^^rae". Condition (iii) implies 
that the string corresponding to S ends with z^ee for zq G {r,e}. Then we have 

Case 4: After multiplication by xi, one caret must be removed to obtain the reduced tree pair 
diagram for the product. Suppose that the string w does not lie in Cases I, 2, or 3. Suppose 
further that the root caret of T has a right child that has a left child. If the following three 
conditions hold 

(i) the right child of the root caret of T does not have a right child; 

(ii) the left child of the right child of the root caret of T does not have a right child (but may 
have a left child); and 

(iii) the final caret of S is an exterior caret with no left child 

then multiplication yields an unreduced representative gx\ = {T*,S*). We must eliminate a single 
redundant caret from each of T* and S* to obtain gxi = (T', S'). 

It follows from conditions (i) and (ii) that the string in corresponding to T is Then 

for zq G T,t and zi G {e, r} we have 

•"(S')"©©)©©©- 

= ©(;)©©©. a-d 

To avoid overlap with Case (3), if xb = 0 then zq ^ {e, r} to ensure that there is only one redundant 
caret in ^{gxl). 

To summarize Cases (I) through (4), let Mi be the set of all (8i(u, v) where u = v{g) and v = v{gxi) 
are as listed in Cases (I) through (4), where Xi ^md Xi &re now allowed to be any strings in As 
with the construction of Lxq, we are not concerned with whether u,v ^ J-. Each possible difference 
between the strings u and v in the previous four cases is easily detected by a finite state machine, and 
thus the language Mi is regular. Finally, intersect Mi with the language {(8)(x,y) | x G J”, y € 
where the latter is a nonblind deterministic 2-counter language. Denote the intersection by M] it is 
also a nonblind deterministic 2-counter language and consists of exactly those convolutions covered 
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in Cases (1) through (4). Note that it is enough to check that one of u and v lies in T because the 
construction of Mi ensures that when one component lies in the other one does as well. 

Case 5: No carets must be added to multiply by xi and the resulting tree pair diagram is necessarily 
reduced. That is, suppose that g = (T, S) does not belong to any of the previous cases. Then 
multiplication by xi does not affect the range tree; if 5 = {T,S) and gxi = then S = S'. 

The trees T and T' are depicted in Figure [5j We consider two cases according to whether the 
subtree C is empty, and note that a finite state machine can detect which case a convolution lies in. 
Note that ^ 2-counter language, because is regular and T requires two counters. 

In Case 5, there are at most two carets which change type between T and T'\ 

(1) The parent caret of subtrees B and C; this caret is interior in T and exterior in T', and we 
denote it d. This change always occurs. 

(2) The root caret of subtree C, which changes type if and only if C is not empty. We denote 
this caret by /. 


We show below that the xi multiplier language can be easily expressed using 3 counters. In this 
case, we switch our perspective and focus on the tree T' from the tree pair diagram for gxi rather 
than the tree T. 


Case 5(a): Subtree C is empty. In this case. 


u = i^{g) = 


lA 

XA 




IB 

XB 


Z\ V^2 


ID 

XD 


and 


V = iy{gxi) = 


7a \ ( r \ ( jB). ( e \ ( e \ ('jD 
oca) \ZoJ \Xb) \ZlJ \Z2j \XDy 
for some zo,zi,Z 2 € Let ICi be the regular language of all strings ( 8 )(tt,u) where u and v have 
the above form with 7 ^ and 7 c replaced by arbitrary strings from and 7 a, 7 d and xy replaced 
by arbitrary strings from for Y G {A, B, D}. As XB consists only of interior caret types, a finite 
state machine can easily check that after is read in v, the next symbol (p where x ^ {a, ^, (,)} 
is and that it is paired with in u. Assume without loss of generality that this condition 

is also verified in JCi. Then = /Ci n is a 2-counter language accepting exactly those 

convolutions 0 (n, u) in Case 5(a), that is, with (7 = 0. 


Case 5(b): Subtree (7 is not empty. In this case, in ^{gxi) write 7 c = gif 112 and in 12(g) write 
7 c = hi /^2 where /, resp. /, corresponds to the root caret of the subtree (7. Here gi G and 
we let g'l and g '2 in respectively denote the strings of caret types paired with the caret types in 
gi and r/ 2 - Then 


u = 12(g) 










and 

V = i2(gxi) = 

where zq, zi, Z2, Z3 G Sy and 






( 1 ) if the right subtree of the root caret of (7 is empty then / =) and / = a, or 

( 2 ) if the right subtree of the root caret of (7 is not empty then f = b and f = (■ 


Let JC 2 be the regular language of all strings ( 8 )(u,u) where u and v have the above form with 
all 7 s, gi and g 2 replaced by arbitrary strings from and 'yA,'yD,XY,gi and g '2 replaced by 
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arbitrary strings from for V € {A, B, L>}. As in Case 5(a), assume that the finite state 
machine recognizing JC 2 verifies that the change in the caret type of the parent caret of subtree 
B occurs in the appropriate position (that is, the first following the root caret in the string 
representing T'). 

To detect that the change in type from caret / to caret / occurs in the correct position in the string 
we will use a single counter. The counter instructions used to show that a string from forms 
a valid interior subtree were as follows: 

(1) When the machine reads “('' the counter is incremented by 1. 

(2) The machine may read only when the value of the counter is positive, and then the 
value of the counter is decremented by 1. 

(3) If the value of the counter is 0 and the letter “6" is read, the machine proceeds to a fail 
state. 

When these counter instructions are applied to a machine reading the value of the counter 

is 0 before each substring yy is read, for Y € {A,B,C,D}. Moreover, after the final letter of iji is 
read, the value of the counter is again 0. If r ]2 is nonempty, then upon reading “(" corresponding 
to the root caret of subtree C (caret /) the value of the counter is set to 1 and will not equal zero 
again until after the entire string r ]2 has been read by the machine. Hence in the Figure [6] we check 
that the value of the counter is strictly positive as 772 is read. 


(5) (i) (.hi 



Figure 6. For ^{u,v) as in Case 5, this machine checks that the change in the 
root caret of subtree C from T to T' occurs in the correct spot in the string and 
has the correct label change. The edge labels correspond to the letters in the string 
V = v{gxi) G F and the corresponding letter from u = v{g) is ignored, except at the 
root caret of subtree C, where both the letter from u and from v are considered. The 
letters zq,zi,Z 2 are as in the expressions for u and v above, y G T,int and Xi G Sy. 

The start state is go and the accept state is q^. 

The machine in Figure [6] reads convolutions <^{u,v) G /C 2 . The state transitions are determined 
solely by the letter from v, with one exception: the carets corresponding to the root caret of the 
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subtree C in T' (or T). In Figure [ 6 ] we label the state transitions only by the letter read in v except 
at this position. When v a J- as well, this machine works as follows. Let y G T,int and x G T,t be 
arbitrary letters. Let I denote the set of counter instructions given above. The label denotes 

the appropriate counter rules from 1 which apply to the letter for a,/3 G Sy. 

(1) The loop at state qo reads the string 7 ^ corresponding to subtree A in T' and the corre¬ 
sponding string XA from i/(gxi). The edge labeled (J^) can only be read when this subtree 
is completed. 

(2) The loop at state qi reads the string jb corresponding to subtree B in T' and the corre¬ 
sponding string xb from v{gxi). The edge labeled can only be read when this subtree 
is completed. 

(3) The loop at state q 2 reads the string gi corresponding to the left subtree of the root caret 
of subtree C in T' and the corresponding string g'^. We apply the counter instructions X to 
this part of the machine. After this string has been read, the value of the counter is 0. 

(4) If the right subtree of the root caret of subtree C is empty, then the root caret of C in 
v{gxi) has type in T' and the machine transitions to state gs verifying the counter 
value. As this corresponds to the root caret of C the machine simultaneously checks that 
the corresponding symbol from u = u{g) is (^) before transitioning to state q^. 

(5) If the right subtree of the root caret of subtree C in i'(gxi) is not empty, then the root 
caret of C has type in T' and the machine transitions to state q^ verifying the counter 
value. The machine simultaneously checks that the corresponding symbol from u = i'{g) is 

. The loop at state q^ reads the string g 2 corresponding to the right subtree of the root 
caret of C, using the counter instructions I but additionally verifying that the value of the 
counter is never 0 while this string is read. After the entire string r /2 is read the counter 
value is 0 , and this is verified along the edge leading to state 54 . 

( 6 ) The loop at state q^ reads the string xb corresponding to subtree D in T' and the corre¬ 
sponding string xd- 

Let C 2 be the intersection of the language accepted by the machine in Figure [ 6 ]with 0 (S^,T'); the 
latter verifies that u G T” (and hence rt G T”) as well. Then C 2 is the language of all convolutions 
accepted in Case 5(b); as it is the intersection of a 1 -counter language with a 2 -counter language, 
we conclude that C 2 is a 3-counter language. It follows that is the union of Ci and £ 2 , hence 
a nonblind deterministic 3-counter language. This completes the proof of Theorem 14.11 

6. APPENDIX 

Figure [ 7 ] below shows that the map t'_ defined in Section [ 5 ] is well defined. 
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Figure 7. This figure shows every possible pair of caret labels in T,'_ and the relative 
placement of the carets they encode. 
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